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RELEVANT SAMPLING OF BAND-LIMITED FUNCTIONS 

RICHARD F. BASS AND KARLHEINZ GROCHENIG 

Abstract. We study the random sampling of band- limited functions of several 
variables. If a band-limited function with bandwidth has its essential support on 
CN ' a cube of volume W^, then 0{R'^ log R'') random samples suffice to approximate 

the function up to a given error with high probability. 

(N 

1. Introduction 

The nonuniform sampling of band-limited functions of several variables remains 

q/ . a challenging problem. Whereas in dimension 1 the density of a set essentially 

Ph I characterizes sets of stable sampling [H], in higher dimensions the density is no 

r-| ' longer a decisive property of sets of stable sampling. Only a few strong and explicit 

^ ■ sufficient conditions are known, e.g., [3|[TmiT2]. 

This difficulty is one of the reasons for taking a probabilistic approach to the 
sampling problem [2l|20]. At first glance, one would guess that every reasonably 
homogeneous set of points in M'^ satisfying Landau's necessary density condition 
>> ■ will generate a set of stable sampling. This intuition is far from true. To the best 

of our knowledge, every construction in the literature of sets of random points in 
W^ contains either arbitrarily large holes with positive probability or concentrates 
near the zero manifold of a band-limited function. Both properties are incompatible 
cn . with a sampling inequality. See [2] for a detailed discussion. 

^ ! The difficulties with the probabilistic approach lie in the unboundedness of the 

configuration space W^ and the infinite dimensionality of the space of band-limited 
functions. To resolve this issue, we argued in [2] that usually one observes only 
finitely many samples of a band-limited function and that these observations are 
drawn from a bounded subset of W^. Moreover, since it does not make sense to 
sample a given function / in a region where / is small, we proposed to sample 
/ only on its essential support. Since / is sampled only in the relevant region, 
this method might be called the "relevant sampling of band-limited functions." In 
this paper we continue our investigation of the random sampling of band-limited 
functions and settle a question that was left open in |2], namely how many random 
samples are required to approximate a band-limited function locally to within a 
given accuracy? 
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2 RICHARD F. BASS AND KARLHEINZ GROCHENIG 

To fix terms, recall that the space of band-limited functions is defined to be 

B = {fe L^R'^) : supp / C [-1/2, 1/2]'^} , 

where we have normalized the spectrum to be the unit cube and the Fourier trans- 
form is normalized as /(^) = J^a /(x)e~^''"''^ dx. A set {xj : j G J} C M"^ is called 
a set of stable sampling or simply a set of sampling [7\, if there exist constants 
A,B>0, such that a sampling inequality holds: 

J 

Next, we sample only on the essential support of /. Therefore we let Cr = 

[~R/2,R/2f and define the subset 

B{R,5) = i^feB: ^ \f{x)\'dx>{l-5)\\f\\l^. 

As a continuation of p], we will prove the following sampling theorem. 

Theorem 1. Let {xj : j & 'H} he a sequence of independent and identically dis- 
tributed random variables that are uniformly distributed in Cr. Suppose that R>2, 
that 6 G (0, 1) and v G (0, 1/2) are small enough, and that < e < 1. There exists 
a constant n so that if the number of samples r satisfies 

.1 + Z//3 2R'^ 
(2) r>R'' ^log , 



z/ 



then the sampling inequality 

r 

(3) l-fl-6-u-126K)\\f\\l<Y,\fix,)\'<r\\f\\l for all f e B{R,6) 






holds with probability at least 1 — e. The constant k can be taken to be k = e . 

The formulation of Theorem [1] is similar to [21 Thm. 3.1]. The main point is 
that only 0{R'^\ogR''') samples are required for a sampling inequality to hold with 
high probability. In [2J we used a metric entropy argument to show that 0{R^'^) 
samples suffice. We expect that the order C(i?'^logi?'^) is optimal. We point out 
that in addition all constants are now explicit. 

Our idea is to replace the sampling of band-limited function in B{R, 6) by a finite- 
dimensional problem, namely the sampling of the corresponding span of prolate 
spheroidal functions on the cube [—R/2, R/2Y and then use error estimates. For 
the probability estimates we use a new tool, namely the powerful matrix Bernstein 
inequality of Ahlswede and Winter [1] in the optimized version of Tropp [22]. 

The remainder of the paper contains the analysis of a related finite-dimensional 
problem for prolate spheroidal functions in Section 2 and transition to the infinite- 
dimensional problem in B{R, 5) with the necessary error estimates in Section 3. 
The appendix contains an elementary estimate for the constant k. 
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2. FiNITE-DlMENSIONAL SUBSPACES OF B 

We first study a sampling problem in a finite-dimensional subspace related to 
the set 13{R,6). 

Prolate Spheroidal Functions. Let Pr and Q be the projection operators 
defined by 

(4) PRf = XcRf and <5/ = -7^"^(X[-i/2,i/2]d/) , 

where J-""^ is the inverse Fourier transform. The composition of these orthogonal 
projections 

(5) Ar = QPnQ 

is the operator of time and frequency limiting. This operator arises frequently in 
the context of band-limited functions and uncertainty principles. The localization 
operator A^ is a compact positive operator of trace class, and by results of Landau, 
Slepian, Pollak, and Widom [8l[9l[T71[l9l[23] the eigenvalue distribution spectrum is 
precisely known. We summarize the properties of the spectrum that we will need. 
Let A\^ denote the operator of time-frequency limiting in dimension d= \. This 
operator can be defined explicitly on L^(R) by the formula 

(4)/r(o = £^^ "";(^L^;;'" /w *, ^ ki < 1/2 . 

The eigenfunctions of A-^ are the prolate spheroidal functions and let the corre- 
sponding eigenvalues /i^ = fik{R) be arranged in decreasing order. According to [B] 
they satisfy 

< fik{R) < 1 VA; G N, 
m+i(i?)<l/2</i[ij]-i(i?); 

As a consequence any function with spectrum [—1/2, 1/2] and "essential" support 
on [—R/2, R/2] is close to the span of the first R prolate spheroidal functions. 
In particular, we may think of B{R, 6) as, roughly, almost a subset of a finite- 
dimensional space of dimension R. 

The time-frequency limiting operator Ar on L'^{M.'^) is the d-io\d tensor product 
of v4)j , Ar = Aj^ ® ■ ■ ■ (g) Aj^ . Consequently, a{AR), the spectrum of Ar, is 

d 

a(A^) = {A G (0, 1) : A = n-"-^.""-^. e ^(Ar)} ■ 

i=i 

Since < /i^ < 1, Ar possesses at most R'^ eigenvalues greater than or equal to 
1/2. Again we arrange the eigenvalues of Ar by magnitude 1 > Ai > A2 > A3 ■ ■ ■ > 
An > A„+i > ■ ■ • > 0. Let (pj be the eigenfunction corresponding to Xj. 
We fix R "large" and 6 G (0, 1). Let 

Vn = span {(pj : j = 1,...,N} 
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be the span of the first A^ eigenfunctions of the time-frequency limiting operator 
Aji (one might call them "multivariate prolate polynomials"). For properly chosen 
A^, Vn consists of functions in B{R, 6). See Lemma [51 
By Plancherel's theorem, 

{Qf,9) = (x[-i/2,i/2]rf/,^) = {f,xi~i/2,i/2]'i9) = {f^Qg)- 

Then for / G i3 we have Qf = f, and so 

(6) {AnfJ) = {PRQf,Qf) = {PbJJ) = I \f{x)\^dx. 

Jcr 

We first study random sampling in the finite-dimensional space Vn- In the 
following ||/||2,R denotes the normalized L^-norm of / restricted to the cube Cr = 

[-R/2,R/2Y:' 

\\f\\lR= I \f{x)\'dx. 
JCr 

Proposition 2. Let {xj : j G N} be a sequence of independent and identically 
distributed random variables that are uniformly distributed in [—R/2,R/2]'^. Then 



(7) P| inf -Yi\fix,)\^-^\\f\\lf,)<-^ 



< A^ exp I - 



2 

u r 



R'^il + Z//3) 
for r eN and u >0. 

Proof. We prove the proposition in several steps. First, since Vn is finite-dimensional, 
the sampling inequality for Vn amounts to a statement about the spectrum of an 
underlying (random) matrix. 

Let / = {c,(f)) = Ylk=i^k(pk G Vn, so that \f{xj)\'^ = Y.k,i=i'^kC'i(l)kixj)(j)i{xj). 
Now define the N x N matrix Tj of rank one by letting the {k, I) entry be 



(8) {Tj)ki = (j)k{xj)(f)i{xj) . 

Then \f{xj)\'^ = {c,Tjc). Since each random variable Xj is uniformly distributed 
over Cr and 0^ is the k-th eigenfunction of the localization operator Ar, using ([6]) 
the expectation of the A;/-th entry is 

(9) E (^{Tj)ki) =^J Mx)M^dx = ^{An(pk,<Pi) 

= -^Afc4i k,l = 1,...,N, 

where 6ki is Kronecker's delta. Consequently the expectation of Tj is the diagonal 
matrix 

(10) E(r,) = -ldiag(Afc)=:i^A. 
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We may now rewrite the expression in (jTj) as 



= inf -V((c,T,c)-(c,E(T,)c)) 
c 2=1 r ^-^ 
i=i 

1 ^ 
(11) =A^in(-5^(T,-E(T,))^ 



j=i 

where we use Amin(f^) for the smallest eigenvalue of a self-adjoint matrix U . 

Consequently, we have to estimate a probability for the matrix norm of a sum of 
random matrices. We do this using a matrix Bernstein inequality due to Tropp |22j . 
Let Amax(^) be the largest singular value of a matrix A so that ||y4|| = \^.^J^A* A)^!'^ 
is the operator norm (with respect to the £^-norm). 

Theorem 3. (Tropp) Let Xj be a sequence of independent, random self-adjoint 
N X N -matrices. Suppose that 

EXj- = and \\Xj\\ < B a.s. 

and let 

i=i 
Then for all t > 0, 

tV2 N 
^jj ^ ty ^ iv exp ^^ - — 

i=i 



(12) p(A.,„(|^.Y,)><)<iVexp(-jJ^). 



To apply the matrix Bernstein inequality, we set Xj = Tj — K {Tj). We need to 
calculate \\Xj\\ and || EjE(^i)ll- Clearly E (X^) = 0. 

Lemma 4. Under the conditions stated above we have 

11^.11 <1, 

E (X|) < R-'^A, 

Ml < 
j=i 

Proof, (i) To estimate the matrix norm of Xj, recall that 

(13) \f{x)\<\\fh ^f^l3. 

Hence we obtain 



J 

r 

and a^ = ||^E(X|)|| < "" 



|Xj|| = sup 

ll/l|2=l 



\f{x,)\'-R-^\f\\in 



<ll/lloo-i?-1l/|ll^<||/||2 = l. 
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(ii) Next we calculate the matrix E(X|): 

E (X|) = E (Tf) - R-'^E (TjA) - R-'^E (AT,-) + i^-^^A^ 

= E (T/) - R-'^E {Tj)A - R-'^AE (T,-) + i^^^^A^ = E (T/) - i^-^^A^ 

Furthermore, the square of the rank one matrix Tj is the (rank one) matrix 

N 

{Tj )km = / ,iTj)kl(Tj)im 
1=1 



y^ (pk {Xj ) (pi {Xj ) (pi {Xj ) (pm {Xj ) 



I 

N 



Ei'^'(^^-)n(r, 



'^jjkm 



1=1 



Writing m{x) = X]«=i 10/(^)1 5 "^^ obtain 

(14) Tf=m{x,)T,, 

Let s be the function whose Fourier transform is given by s = X[_i/2,i/2]'' and let 
Txf{t) = f{t — x) be the translation operator. Then it is well known that T^s is 
the reproducing kernel for B, that is, 

f{x) = {f,T,s). 

To see this, by Plancherel's theorem and the inversion formula for the Fourier 
transform, if / G i3, 

(/, T^s) = (/, e-2--«S) = / e2™-«/(0 d^ = f e^^'^^fH) di = fix). 

-Z [-1/2,1/2]'* J 

Since the eigenf unctions (pi form an orthonormal basis for B, the factor rn{xj) in 
^\M is majorized by 

N N 00 

m{x,) = Y,\<Pi{^,)? = Y.\^(PuTx,s)\^ <Y.\{(PuTx,s)\' = \\T,^s\\l = 1. 
1=1 1=1 1=1 

Since T^ < Tj and the expectation preserves the cone of positive (semi) definite 
matrices (see, e.g. [22]), we have E (T/) < E (Tj) = R"^A , and 

E (X|) = E (T/) - R-'^'^A^ < R-'^A. 

(iii) Now the variance of the sum of positive (semi) definite random matrices is 
majorized by 



a2=||5]E(X|)||<||^E(T, 



jld II II - i^d • 
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End of the proof of Proposition Now we have all information to finish the proof 
of Proposition m Since Amin(T) = — Amax(— 7"), we substitute these estimates into 
the matrix Bernstein inequality with t = ru/R'^, and obtain that 



Combined with fllip . the proposition is proved. 



^2^2^-2d 



Random matrix theory offers several methods to obtain probability estimates 
for the spectrum of random matrices. In [2] we used the entropy method. We 
also mention the influential work of Rudelson [15] and the recent papers [TT[|T6] 
on random matrices with independent columns. The matrix Bernstein inequality 
offers a new approach and makes the probabilistic part of the argument almost 
painless. The matrix Bernstein inequality was first derived in pj and improved in 
several subsequent papers, in particular in [I^. The version with the best constants 
is due to Tropp [22] . Matrix Bernstein inequalities also simplify many probabilistic 
arguments in compressed sensing; see the forthcoming book |3]. 

3. From Sampling of Prolate Spheroidal Functions to Relevant 
Sampling of Bandlimited Functions 

Let a be the value of the A^-th eigenvalue of Ar, that is, a = Aat, let i? = Ejq be 
the orthogonal projections from B onto Vn, and let F = F/v = I — E^. Intuitively, 
since / G B{R, S) is essentially supported on the cube Cr, it should be close to the 
span of the largest eigenf unctions of An and thus Ff should be small. The following 
lemma gives a precise estimate. Compare also with the proof of j^l Thm. 3]. 

Lemma 5. If f E B{R,5), then 

6 



11^/112 >(l-Y 
\\Ef\\ln>a[l~j 



— a 
6 



a 

6 „„„, 



\\FfU<- „,„, 

1 — a 



Proof. Expand f E B with respect to the prolate spheroidal functions as / 

YlT=i^3't'j- Without loss of generality, we may assume that ||/||2 = ||c||2 = 1. Sin 
/ G B{R, 5), we have that 

» oo 

l-^<\\f\\lR= / \f{ttdt={AnfJ) = Y.\c,\%. 



'Cr 

Set 



j=i 



N 

2 



A=\\Ef\\l = Y,\c, 
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and B = ^,>^ |cjp = 1 — A = ||-F/||2- Since Xj < Xn = o for j > N, we estimate 
A=\\Ef\\l as follows: 

N N 

oo oo 

J=l j=N+l 



c/ 



> 1 - (5 - Aiv XI 

j=Af+l 

= l-6-a{l-A). 

The inequality A> 1 — (5 — a(l — A) implies that II-E/II2 = A > 1 — j^ and usin^ 
the orthogonal decomposition / = Ef + Ff, 

B=\\Ff\\l< ^ 



1 — a 
Finally, H-E/Hi^ij = J2j=i -^il^P ^ ^^^ ^ '^(^ ~ T^)' ^^ claimed. ■ 

REMARK (due to J.-L. Romero): As mentioned in [2], if / G B{R, 6) and /(xj) = 
for sufficiently many samples Xj G Cji, then / = 0. However, / cannot be 
completely determined by samples in Cr alone. This is a consequence of the fact 
that B{R, 6) is not a linear space. Given a finite subset S C Cr, consider the finite- 
dimensional subspace T-io of B spanned by the reproducing kernels T^s, x G S. If 
G T-Lq, then 0(x) = {(j),Txs) = for x G S*. Thus by adding a function in "Hfj- 
of sufficiently small norm to / G 'B(-R, 6), one obtains a different function with the 
same samples. More precisely, let / G B{R,6) with ||/||2 = 1 and f^ \f{x)\'^dx = 
■y > 1 — 6 and G "H^ with ||0||2 = 1. Then f{x) + e0(x) = f{x) for x G S' and 
f + e(j) & B{R, 6) for sufficiently small e > 0. 

Despite this non-uniqueness, one can approximate / from the samples up to an 
accuracy S, as is shown by the next lemma. 

We will require a standard estimate for sampled 2-norms, a so-called Plancherel- 
Polya-Nikolskij inequality [21]. Assume that X = {xj} C M'^ is relatively separated, 
i.e., the "covering index" 

max ^Xn{k+ [-1/2, 1/2]^^) =: No < 00 

is finite. Then there exists a constant k > 0, such that 

00 

(15) E 1/(^^)1' ^'^^oll/ll^ for all /gS. 

The constant k can be chosen as k = e'^'". Since the standard proof in [2l| uses a 
maximal inequality with an non-explicit constant, we will give a simple argument 
using Taylor series in the appendix. 
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Lemma 6. Let {xj : j = 1, . . . , r} be a finite subset of Cr with covering index Nq. 
Then the solution to the least square problem 

r 

(16) Popt = argmiiipg^^ {^\f{xj)-p{xj)\^j 
satisfies the error estimate 

(17) Yl \fi^j)-Popt{xj)\' < NoK-^WfWl for all f G B{R,6) . 
Proof. We combine Lemma [S] with flT^ . 

r r 



= J2\^f{x,)\'<^No\\Ff\\ 



o:^ \\J Il2 

1 — a 



Next we compare sampling inequalities for the space of prolate polynomials Vn 
to sampling inequahties for functions in B{R, 6). 

Lemma 7. Let {xj : j = 1, . . . ,r} be a finite subset of C^ with covering index Nq. 
If the inequality 

(18) ^E(b(-.)r-i?-ibiil^)>-|iibii^ 

holds for all p G Vn, then the inequality 



(19) Ei/(^^-)r>^ii/ii2 

holds for all f G B{R, 6) with a constant 

A = -—\a v\- 2kNq- 



Rd\ 1-a / 1-a 

REMARK: For A to be positive we need 



r> R 



^ 2kNq-^ 

1 — Q 
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Proof. Using the triangle inequality and the orthogonal decomposition / = Ef + 
Ff, we estimate 



j=l j=l j=l 

Taking squares and using (ITSl) on Ef and Ff in the cross product term, we continue 



as 



1/2 / J~^ \ 1/2 



i=i j=i j=i i=i 

r 

j=i 

r 
>El^/(^^-)l'-2'^^0p/||2||F/||2 

i=i 

>E|i^/(^.)P-2«:iVo^||/||^, 
i=i 

since by Lemma 0, II-F/II2 < Yz^ll/lli ^^^ H-E/lh < ll/lb- Now we make use of 
hypothesis ( TT8l) and Lemma [5] and obtain 

i=i i=i 

> j-JEfWl^ - -^||B/||= - 2nN„j—\\fg 

So we may choose A to be 



Rd\ 1-a J 1-a 



The final ingredient we need is a deviation inequality for the covering index 

No = max,^^4xj} n {k + [-1/2, 1/2]^). 

Lemma 8. Suppose R>2 and {xj : j = 1, . . . ,r} are independent and identically 
distributed random variables that are uniformly distributed over Cr. Let a > R~'^. 
Then 

P(iVo > ar) <{R + 2^ exp(-r (a log(ai?'^) - (a - R-'^)) 
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Proof. Let Dk = k+ [-1/2, 1/2]'^ for k e Z'^. Note that we need at most {R + 2^ 
of the -Dfc's to cover Cr. If A^o > o.^^ then for at least one k, D^ must contain at 
least ar of the Xj's. Therefore 

(20) F{No > ar) < {R + 2YmaxF{i^{xj} D Dk > ar). 



Fix k G Z'^. For any 6 > 0, by Chebyshev's inequality 

r r 

P(#{xJ nDk> ar) = p(^xd,(x,) > ar'^ = p(exp {bY,XD,{x,)) > e'"^^ 

i=i i=i 

r 

<e-''-Eexp(65^XD,(x,)). 

i=i 

Since the Xj are uniformly distributed over Cr, then Xn^i^j) is equal to 1 with 
probability at most i?"'^ and otherwise equals zero. Therefore, using the indepen- 
dence, 

r 

F(4^{xj} nDk> ar) < e"''"" JjEe^'^^fc^^^') 

< €-'"''{{1 - R-'^) + e^R-'^y = e^^'^"((l + (e'' - l)i?-'^)'' 

<e-''""(exp((e^-l)i?-<^))^ 
With the optimal choice b = \og{aR'^) the last term is then 

exp ( - r (a \og{aR'^) - (a - R''^))) . 
Substituting this in ( l20l) proves the lemma. ■ 

By combining the finite-dimensional result of Proposition [2] with the estimates 
of Lemmas [7] and [8] and the appropriate choice of the free parameters, we obtain 
the following theorem. 

Theorem 9. Let {xj : j E fi} he a sequence of independent and identically dis- 
tributed random variables that are uniformly distributed in Cr. Suppose R> 2, 

1 

^ < —, r, 

2(1 + 12k) ' 

and 

u < \-6{1 + 12k). 

Let 

(21) A = ^^(^\-5-v-mK 
Then the sampling inequality 

r 

(22) AWfWl < J2 l/(^.)l' < ^Il/ll2 for all f G B{R, 6) 

i=i 
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holds with probability at least 



(23) 1 - i?'^ exp - 



2 

u r 



R'^il + Z//3) 



j -(i? + 2)'^exp(--^(31og3-2) 



Proof. Since |/(a;)| < II/II2 for f & B, the right hand inequahty in ( l22l) is immediate. 
We take a = 1/2 and N = R^ in Proposition [2] and a = 3R~'^ in Lemma El Let 

i=i 

and let 

V2 = {No > ar}. 

By Proposition [2] and Lemma [HI the probabihty of {Vi U V2Y is bounded befow 
by (123]). By Lemma d 

^Ei/(^.)r>^ii/ii2 

i=i 
for all / e B{R, 6) on the set {Vi U ^2)'- With a = 1/2 and iVo = 3i?~^ the lower 
bound A of Lemma [7] simplifies to A = ;^(|~<^~^~ 12(5 /t j . Our assumptions 
on 6 and u guarantee that A > 0. ■ 

The formulation of Theorem [1] now follows. With N = R'^ and < z/ < 1/2 — 5 < 
1/2, if e > is given and 

(24) 



:>d 



1 + W3, 2R'^ R'^ , 2(i? + 2)'^\ „wl + z//3, 27?*^ 

^log ,^r^ — ^log^ -] =R ^log 

v-^ e 3 log 3 — 2 e / z/^ e 



then the probability in (1231) will be larger than 1 — e. 

REMARK: Observe that the parameters 6 and R are not independent. As men- 
tioned in P, p. 14], for B{R,6) to be non-empty, we need 6 > 2nV2Re~'^^ (up to 
terms of higher order) . Thus for small 6 as in Theorem M we need to choose R of 
order R f» c\og{d/6). 

Appendix A. The Plancherel-Polya inequality 

We finish by showing that the constant k, in the Plancherel-Polya inequality flTSl) 
can be chosen explicitly to be k = e'^^. The argument is simple and well-known, 
see, for example, j5]. 

Lemma 10. Let {xj : j E N} be a set in M*^ with covering index Nq. Then 






dwu _f ||2 
2 
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Proof. Let k e Z'^ andxj G k + [-l/2, 1/2] =: Dk. Then \\xj-k\\oo < 1/2. Consider 
the Taylor expansion of f{xj) at k (with the usual multi-index notation): 

We now let 6 G (0, 1) and apply Cauchy-Schwarz: 

(25) |/(,^.)P<E;^(i)*'E^^(^)'"™"' 

a>0 ■ a>0 



d/40 Y^ |^"/(fe)P nN2(i-e)|a| 



a. 

If / G i3, then by Shannon's sampling theorem (or because the reproducing 
kernels T^s, k G Z,'^, form an orthonormal basis of B) we have 

To estimate the partial derivatives we use Bernstein's inequality ||i5"/||2 < tt'"' ||/||2- 
We first assume that A'"o = 1, i.e., each cube Dfc contains at most one of the Xj's. 
Then we obtain, after interchanging the order of summation 

jeN a>o fee^d 



E(9 

(26) < e^/^^ Y: (i) 



a>0 



2\a\ 
d/A" \-^ (l\2{l-9)\a\'n^ II „||2 _ ^/4e rf^2/4l-e 2 



a>0 



2 — ^ ^ \\J 112 



The choice 4^ = 2/7r yields the constant k = e'^l^ e'^'^ 1^ = e^'^. For arbitrary 
A^o WG obtain 



El/(^^-)'l=E E l/(^.)'l<A^oe 



dn\l /||2 
2 5 



as claimed. 



Possibly the Plancherel-Polya inequality could be improved to a local estimate 
of the form J2x ec l/(^iP — ^^o||/||2,_R) but we did not pursue this question. 
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